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Abstract 

How does dynamic price information flow among Northern European elec- 
tricity spot prices and prices of major electricity generation fuel sources? We 
use time series models combined with new advances in causal inference to 
answer these questions. Applying our methods to weekly Nordic and Ger- 
man electricity prices, and oil, gas and coal prices, with German wind power 
and Nordic water reservoir levels as exogenous variables, we estimate a causal 
model for the price dynamics, both for contemporaneous and lagged relation- 
ships. In contemporaneous time, Nordic and German electricity prices are 
interlinked through gas prices. In the long run, electricity prices and British 
gas prices adjust themselves to establish the equlibrium price level, since oil, 
coal, continental gas and EUR/USD are found to be weakly exogenous. 

Key words: Vector autoregression. Vector error correction. Electricity 
markets. Causal discovery, Non-Gaussianity, Directed acyclic graph. 
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1. Introduction 



There is an ongoing debate on the convergence of and price dynamics in 
energy markets, and electricity markets in particular. For the US market. 



Park et al. (2006) use advances in causal flow modelling and find that the 



dynamic relationships between electricity markets not only are governed by 
transmission lines, but also by different market structure and regulation. 
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Using similar techniques, the same authors indicate that the Canadian and 



US natural gas market is a single highly integrated market (Park et al. , 2008 ) 



Mjelde and Bessler (2009) go one step further, and investigate how weekly 



dynamic price information flows among major US electricity generation fuel 
sources: natural gas, uranium, coal and crude oil. They find that peak 
electricity prices move natural gas prices, which in turn influence crude oil 
prices. 

To our knowledge, price dynamics among Northern European electricity 
markets and their major fuel sources has not been closely looked into before. 



Zachmann (2008 ) rejects the hypothesis of full market integration of Northern 
European electricity markets. We will focus on the Nordic and German 
electricity markets (Weron, 2006 ). The Nordic electricity market (Nord Pool) 



is dominated by highly flexible hydro power (54% in 2007 (Fridolfsson and 



Tangeras, 2009)), and even though congestion within the Nord Pool area 
is not uncommon (Marckhoff and Wimschulte, 2009), we will consider the 



common Nordic system spot price here. The German EEX market, being 
the largest market in Europe, is on the other hand dominated by coal (47%) 
and nuclear power (23%) (Brunekreeft and Twelemann, 2005). Gas (17%), 
hydro and an increasing wind power production complement the picture. 
The EEX market is generally assumed to be less mature than the Nordic 



market ( 


Weron 


2006 


Weigt and von Hirschhausen 


Fridolfsson and Tangeras 2009 


)• 



2008; Miisgens, 2006 



We investigate the price dynamics between electricity prices and major 
fuel sources (oil, gas, coal) by estimating a causal model for the price dynam- 
ics, where Nordic water reservoir levels and German electricity production 
from wind mills are treated as exogenous variables. Mjelde and Bessler (2009 ) 
estimate a vector error correction model (VECM) for logarithmic prices. A 
directed acyclic graph (DAG) ( [Spirtes et al. , [2000 ) representing instantaneous 
causal influences is then found from the resulting contemporaneous correla- 



tion matrix, using the greedy equivalence search (GES) algorithm of Chick- 



ering (2003). 



Most causal DAG learning algorithms, including the GES algorithm, are 
based on the assumption that variables are jointly normally distributed. 
These methods share a fundamental problem: Several DAGs usually cor- 
respond to same joint distribution, so one only obtains an equivalence class 
of DAGs that are indistinguishable from data. While some directions of 
causal influences (edges in the DAG) may be the same for all DAGs in the 
equivalence class, usually many or most directions are left undetermined. 
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In the present paper, we rely on the assumption of non-normality, us- 
ing the hnear non-Gaussian acychc model (LiNGAM) recently developed by 
Shimizu et al. ( 2006a|[b ). This allows us to identify one single DAG. Because 



of this, we are also able to coherently integrate both contemporaneous and 
time-lagged causal relationships into the same DAG analysis. For our data, 
the GES algorithm is only able to identify undirected contemporaneous asso- 
ciations. The LiNGAM approach, on the other hand, provides instantaneous 
and time-lagged directed causal influences. 

2. Methods 

The three basic building blocks of our data analysis are the vector au- 
toregression (VAR) model, the vector error correction model (VECM) and 
the linear non-Gaussian acyclic model (LiNGAM) (Shimizu et al. , |2006a|[b ) 



We will now describe each in turn, before we combine them to estimate both 
instantaneous and lagged causal effects. 

2.1. Vector autoregression model 



The vector autoregression model (Hamilton, 1994) is a standard tool of 



econometrics and multivariate time series analysis. Let the endogenous vari- 
ables Xt and the exogenous variables Zt be observed random vectors depend- 
ing on (time) t = 1,2, . . .. The basic idea of the VAR model is that the 
endogenous variables depend linearly on their k previous values, as well as 
the current value of the exogenous variables, i.e. 

k 

Xt = tl + ^MrXt-r + lZt + et, (1) 

r=l 

where Mr and 7 are coefficient matrices of size n x n and n x d, respec- 
tively, where n is the number of endogenous variables and d is the number 
of exogenous variables. Further, ^ is a constant vector and is a vector of 
residuals (innovations). 

All variables must have the same order of integration. If all variables are 
stationary, /(O), we have the standard case of a VAR model. If all variables 
are non-stationary, I{d), d > 1, there are two possibilities. First, if the 
variables are not cointegrated, the variables must be differenced d times in 
order to obtain a VAR. Second, if the variables are cointegrated, we may use 
a vector error correction model (VECM). 
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2.2. Vector error correction model 

We here consider the case where the variables Xt are /(I), so that they 
are differenced one time in order to achieve stationarity. The vector error 
correction model (VECM) can be derived from the VAR model in ([T]), 



fc-i 



(2) 



where A is the difference operator (Aa^t = Xt — Xt-i), and is an n x n 
matrix relating changes in Xt for lagged r periods to current changes in Xt. 

The matrix 11 is called an error correction term, which compensates for 
the long-run information lost through differencing ( Juselius, 2006). 11 = a/3', 



where cx and f3 are of dimensions n x r, where the rank r is the number of 
cointegration relationships. The r linearly independent columns of f3 are the 
cointegrated vectors, each representing one long-run relationship between the 
series, and (3'xt-i is then stationary. 

If r = 0, the matrix 11 does not exist, and we have a VAR in difference, 
not a VECM. If we have full rank, r = n, it does not make sense to specify 
the model as a VECM, as the stationary /S.Xt in ([2]) will be equal to a non- 
stationary Ylxt-i plus some lagged stationary variables and so on, which is 



inconsistent (Juselius, 2006). 
Comparing (jlj) with ([2j) gives 



n = a/3' 



(3) 



and 



(4) 



i=T+l 



2.3. Linear non- Gaussian acyclic causal model 

In general, a linear causal model on the zero-mean (centered) random 
variables i = 1, . . . , m, can be defined by 



yi= ^ijyi 

k{i)<k{j) 



(5) 
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where the £jS are random noise terms and A; is a permutation over {1, . . . , m}. 
We interpret as a causal ordering of the variables, where later variables 
cannot cause earlier variables. Equation ([s]) can be represented as a di- 
rected acyclic graph (DAG) with vertices corresponding to Ui and edges cor- 
responding to a nonzero Estimating causal DAGs from observational 



data has received considerable interest in recent years (Pearl, 2000; Spirtes 
et al. , 2000). For continuous standard methods assume that the noise 



terms Si are jointly normally distributed, and use the estimated covariance 
matrix to infer the DAG. 

Several methods for inferring DAGs from Gaussian observational data 



have been proposed. To enable comparison of our results and those of Park 



et al. 


( 


2008 


), 


of 


Chickering 



we employ the Greedy Equivalence Search (GES) algorithm 
of Chickering (2003), as implemented in the software Tetrad IV (2004). The 
GES algorithm uses a score to evaluate how well a suggested DAG fits the 
data. Starting with an empty DAG, a greedy search over equivalence classes 
(defined below) is done. The search concludes when local maximum of the 
score is reached. 

A general problem with the normality-based methods such as the GES 
algorithm is that, even with an infinite amount of data, one cannot iden- 
tify a unique causal model (DAG), only a so-called Markov equivalence class 
consisting of several different DAGs corresponding to the same joint distri- 
bution. This is easily seen in the case of two variables yi and 5 where there 
is clearly no way of distinguishing between the models yi — )■ ?/2 and yi i— y2 
based on the covariance structure alone, and {yi — )■ y2, yi ^ 2/2} is a Markov 
equivalence class. With three variables, the Markov equivalence classes are 
{yi ^ 2/2 ^ 2/3,2/1 ^ 2/2 ^ 2/3,2/1 ^ 2/2 2/3} and {^1 2/2 ^ 2/3}- For an 



extensive discussion of this problem, see Shimizu et al. (2006b) 



In contrast, when assuming that the noise terms are independent and 
non-Gaussian, a unique causal structure is in fact identifiable. Equation ([s]) 
is then known as the LiNGAM (Shimizu et al. , 2006a). Writing ([s]) in matrix 
form: 

y = By + e (6) 

where y = {yi, . . . , ym)', £ = {^i, • • • , £m) and B is the (permutable to lower 
triangular) matrix of coefficients (3ij. The independence of the elements of 
£ implies that there are "no unobserved confounders" in the sense of Pearl 



(2000), so a causal interpretation is vahd (cf. Shimizu et al. (2006a), Section 
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2). Letting A 



(I-B) 



we can rewrite (|6| as 
Ae. 



y 



Since the variables in £ are independent and non-Gaussian, 



Independent Component Analysis (ICA) model (Comon 1994; Hyvarinen 



and Oja 2000). 



(7) 

([7|) defines the 



In ICA, the goal is to estimate both the so-called mixing matrix A and 
the independent components e. Essentially, in ICA we aim to find A and 
£ such that the entries of £ are as statistically independent as possible. By 
an argument based on the central limit theorem, this problem can also be 
posed as finding components which are as non-Gaussian as possible. Non- 
Gaussianity can be measured using the concept of entropy. The entropy of 
a random vector y with density / is defined as H{y) = — J f{y) log f{y)dy. 
Among random variables with a given variance, Gaussian variables have the 
highest possible entropy. Therefore, we can measure non-Gaussianity based 
on negentropy J, which is defined by J{y) = H{yg) — H{y), where y^ is a 
Gaussian random vector having the same covariance matrix as y. Clearly, 
J{y) is zero for Gaussian y and positive for non-Gaussian y. The iterative 



fixed-point algorithm fastICA (Hyvarinen, 1999) estimates A efficiently and 



robustly based on approximations to negentropy. 

It can be seen from ([T]) that both A and £ can only be estimated up to 
a scaling constant and a permutation. However, both the scaling and the 
permutation can be found in the application of ICA to LiNGAM, as shown 



by Shimizu et al. (2006a). After estimating A, the coefficient matrix B is 
immediately available as / — A^^. 

2.4- Combining instantaneous and lagged effects 
Our interest is here in the following model: 



T=0 



+ -yzt + £t 



(8) 



The difference between (|8]) and the VAR model defined in ([T]) is the inclusion 
of instantaneous causal effects Bq, where the matrix Bq corresponds to a 
DAG (i.e., can be permuted to strict lower triangularity) as in Section 2.3 
Bi,B2, . -. contain autoregressive effects, and their corresponding graphs 
may be cyclic. 

To estimate the model in ([8|, we customise the method described by 



Hyvarinen et al. (2008): 
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1. Estimate a VECM model for the data, see (|2]). We here obtain the 
coefficient matrices 11 and Fi, . . . , r^-i, together with and 7. 

2. Translate the estimated VECM coefficients into a VAR representation, 
see J3]) and Q. We then obtain the coefficient matrices Mi, . . . ,Mk 
in ff. 

3. Compute the residuals et, 



et = xt-p.-jzt-^ Mr 

T = l 



X 



t~T- 



(9) 



4. Perform the LiNGAM analysis on the residuals to find an estimate of 
the instantaneous effect matrix Bq. This matrix is a solution to the 
model 



et = BoSt + St. 



(10) 



See Section 12.31 for details. 
5. Compute the matrices of lagged causal effects, Br, r > 0, which are 
given as 



Br = {I- B,)Mr. 



(11) 



How do we find (11)? Equation ([8j) gives 

k 



[I - Bo)Xt = + ^ BrXt r + JZt + Et 



r=l 



This gives 



Xt 



[I - Bo)- V + - Bo)-'BrXt.r 



T=l 



I-Bo)-'7Zt + iI-Bo)-'et. 



(12) 



Comparing (|T2|) with Q, we find that {I-B^Y^Br = Mr for r > 1. Also, 
we see that (/ — BQ)^^St = et, which gives rise to (10). 
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2.5. Time-lagged causal flow and Granger causality 

One may ask whether a time-lagged causal flow is different from Granger 
causality. Granger causality is the ability to reduce the prediction error 
(Hamilton, 1994). Based on the VAR representation ([T]), a variable i Granger 



causes the variable j if at least one of the coefficients of M^- from xf}^ , t > 1, 
to a;SiV is significantly non-zero, since this reduces the prediction error in x 



Hyvarinen et al. (2008) proposed a combined definition of Granger causality: 
If at least one of the coefficients i), r > 0, is significantly non-zero, 



variable i causes j. See Hyvarinen et al. (2008) and Zhang and Hyvarinen 



(2009) for a more thorough discussion. 



3. Data 

We focus on the Nordic and German electricity markets, their major 
fuel sources (gas, coal, oil) and physical variables known to partly explain 
Nordic and German electricity prices; German wind power production and 
Nordic water reservoir levels. The data consist of 365 weekly observations 
of each variable from 2002-2008. Ideally, we should have used data further 
back. However, firstly, the wind data were not available until 2002. Secondly, 
six years is a long time in quite rapidly evolving and increasingly integrated 



European gas and electricity markets (Zachmann, 2008; Bunn and Gianfreda 



2010 Ruperez Micola and Bunn 2007). The markets were less mature further 



back, but if wind data had been available, we could have included 2001 data 
as well. 

All price series are given in or converted to EUR. Transforming all prices 



to a common currency (Hovanov et al. , 2004) could induce dependencies 
related to exchange rate fluctuations and not energy price fluctuations. For 
that reason, and since exchange rates may also influence commodity prices 
dChen and Chenl [2007] [Akramf [20091 [Zhang et alj [2008] ), we include the 
EUR/USD exchange rate as well. 

All price series are given as average^ over the week, since the producers 
try to maximise the accumulated income and buyers are likewise minimising 
their accumulated expenses. Had we instead considered, say, the hourly price 



^ Using weekly average spot prices might introduce additional correlation into the series 
or differenced price series (Working, 1960). Under some applications one might want to 



use daily observations to avoid additional complications induced by averaging. 
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at hour 24 each Sunday, our results would not necessarily say much about 
price information flow between the weekly price levels. 

An overview of the data is given in Table [T| and they are displayed in 
Figure [T}|3j 

We have included two gas prices here: Zeebriigge and NBP, represent- 
ing continental Europe and the United Kingdom, respectively. A few other 
gas markets are more relevant for the German electricity market than the 
Zeebriigge gas market, but the historic data period is then not long enough. 
The Zeebriigge and NBP gas markets are connected through the Bacton- 



Zeebriigge interconnector (Ruperez Micola and Bunn, 2007). We expect the 



Zeebriigge gas market to be more important than the NBP market for the 
electricity price formation, since it is closer to the German (and Nordic) 
market. 

We will treat German wind and Nordic reservoir levels as exogenous vari- 
ables in ([2]), which to some extent is debatable. The electricity market does 
not influence the wind itself, but may have contributed to the long term 
increase in wind power mills. Similarly, the electricity market does not in- 
fluence the inflow into water reservoirs, but the water reservoirs are ideally 
used when prices are high. Still, we find it most correct to treat these two 
variables as exogenous. 

All time series, except reservoir levels, were log-transformed. Since the 
reservoir levels are bounded by and 100%, they were logit-transformed. 
Next, all transformed variables except the oil price, were seasonally adjusted 
by subtracting a seasonal term 

A, ^ t /^f . n (Hf)-.,f cos (M), 

which was estimated by least squares regression. This was done in order to 
have variables that represent deviations from a normal level. 
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Data 

Nordic el. price 
German el. price 
Oil price 

Gas price 1 

Gas price 2 
Coal price 
EUR/USD 
Water Nordic 
Wind Germany 



Description 

Nord Pool system 

European Energy Exchange (EEX) 

Brent crude, International 

Petroleum Exchange (IPE) 

National balancing point 

(NBP), UK 

Zeebriigge, Belgium 

GIF ARA, Northwest Europe 

Exchange rate 

Reservoir levels, Norway + Sweden 
Electricity production, wind plants 



Resolution 

Weekly, average spot price 
Weekly, average spot price 
Weekly, average spot price 

Weekly, average spot price 

Weekly, average spot price 
Weekly, average physical price 
Weekly, average rate 
Values for each Monday 
Weekly, average production 



Table 1: Data overview. The data range from the first week of 2002 to the last week of 
2008, in total 365 weekly values for each of the variables. Since German wind production is 
not publicly available, the German wind production data were calculated by Point Carbon 
( |http://www.pointcarbon.comJ . 
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Figure 2: Weekly oil, coal and gas prices, and the EUR/USD exchange rate. All price 
series are given in Euro and indexed to 100 at Week 1 January 2002. 



12 




Reservoir levels, Norway+Sweden 

01 02 Q3 CM 01 02 03 04 01 02 03 04 01 02 03 OS 01 02 03 04 01 02 03 04 01 02 03 04 01 
2002 2003 2004 2005 2006 2007 2008 2009 



(a) Water reservoir levels, Nordic (Norway+Sweden). 



Wind Germany 




i"i"i"i"i"i"i"i"i"i"i"i"r'i"i"i"i"i"i"i"i"i"i"i"i"i"i"i"i 

Q1 02 03 04 Q1 02 03 04 01 02 03 04 01 02 03 04 01 02 03 04 01 02 03 04 01 02 03 04 01 
2002 2003 2004 2005 2006 2007 2008 2009 



(b) Electricity production from wind plants in Germany. 
Figure 3: Weekly values of exogenous variables. 
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4. Results 



4.1. Time series analysis 



Using the criteria of Akaike, Schwarz and Hannan and Quinn (Claeskens 



and Hjort, 2008), the optimal lag order of the unrestricted VAR with the two 
exogenous variables ([T]) was found to be two (Table |2|, even though Akaike's 
criterion was almost as good for three lags. Phillip-Perron unit root tests 
indicate that there is no evidence for stationarity of the series Oil, EURUSD 
and Coal. The same tests performed on the first differences indicate that 
these are stationary. Table [3] shows the test statistics and the p-values for 
each of the series. 



Model selection criterion 


Lag order 
1 2 3 4 5 


Akaike 
Hannan and Quinn 
Schwarz 


-38.96 -39.55 -39.54 -39.51 -39.38 
-38.65 -39.03 -38.81 -38.57 -38.23 
-38.19 -38.25 -37.71 -37.14 -36.48 



Table 2: Akaike, Schwarz and Hannan and Quinn model selection criteria for the optimal 
lag order of the VAR with a constant. 





Original data 
statistic p-value 


First differences 
statistic p-value 


ElNordic 
ElGermany 
Oil 

EURUSD 
Coal 
GasNBP 
GasZEE 


-3.6685 0.0050 
-4.5652 0.0002 
-1.5193 0.5229 
-2.1031 0.2437 
-1.3370 0.6132 
-3.1755 0.0223 
-3.0617 0.0304 


-14.5090 0.0000 
-26.8163 0.0000 
-15.2474 0.0000 
-14.9070 0.0000 
-10.8962 0.0000 
-23.8243 0.0000 
-23.2813 0.0000 



Table 3: Stationarity test using the Phillip-Perron unit root test: Test statistics for both 
the original time series (on log scale) and the first differences of the time series (on log 
scale), when testing for stationarity. The null hypothesis is that the series has a unit root, 
i.e. that they are non-stationary. 



We have also used another test of stationarity, the KPSS test (Kwiatkowsk 



et al. , 1992), where the null hypothesis is that each of the series is station- 



ary. All time series, except for ElNordic, are rejected at a 1% significance 
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level. ElNordic is rejected at a 5% significance level. The KPSS test on first 
differences indicate that all time series are level-stationary when differenced. 

We use the trace test to determine the number of cointegrating vectors, 
and the Schwarz criterion for determining whether the constant is within 
or outside the cointegration space. The trace test indicates that there are 
three cointegrated vectors at a 1% significance level, and the Schwarz cri- 
terion indicates that the constant is inside the cointegrating spac^ Table 
?? shows the test statistics and the critical values for different ranks when 
the constant is within the cointegration space. Table |5] shows the Schwarz 
criterion for the constant within and outside the cointegration space, when 
the cointegration rank is three. Since the cointegration tests can be sensitive 
to the lag structure of the VAR (Kasa, 1992), we have repeated the trace 
test for up to five lags in the VAR. In any case, and regardless of whether 
the constant is within or outside the cointegration space, we conclude that 
the number of cointegrating vectors is three. 



Table 4: Trace test of cointegration, when the constant is within the cointegration space. 



Rank 


Trace test 


Critical values 


(r) 


statistic 


10% 


5% 


1% 


r < 6 


3.72 


7.52 


9.24 


12.97 


r < 5 


13.92 


17.85 


19.96 


24.60 


r < 4 


29.39 


32.00 


34.91 


41.07 


r < 3 


59.77 


49.65 


53.12 


60.16 


r < 2 


102.23 


71.86 


76.07 


84.45 


r < 1 


172.97 


97.18 


102.14 


111.01 


r = 


280.18 


126.58 


131.70 


143.09 





Schwarz loss 


Constant within 
Constant outside 


-3487.45 
-3482.29 



Table 5: The Schwarz loss when the constant is inside and outside the cointegration space, 
for the case of three cointegrated vectors. 



^The Schwarz criterion indicates that there are four cointegrating vectors, but we pro- 
ceed with the trace test's conclusion. 
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A cointegrating vector is a stationary linear combination of possibly non- 
stationary vector time-series components. This combination might consist 
of only one of the series, which then must be stationary. It is interesting to 
test if this is the case, especially since the Phillip-Perron test suggests that 
several of the series are stationary. Table |6] shows the p-values of this test 
applied to each of the series, whose null hypothesis is that the series is by 
itself one of the cointegrating vectors. We see that all series are rejected, 
indicating that none of the cointegrating vectors consist of only one of the 
series. 





P-value 


ElNordic 


0.0005 


ElGermany 


0.0000 


Oil 


0.0000 


EURUSD 


0.0000 


Coal 


0.0000 


GasNBP 


0.0002 


GasZEE 


0.0001 



Table 6: Test of whether a series by itself is one of the cointegrating vectors. 

After fitting a VEC model with rank three to our data, see ([2]), we perform 
a normality test on the residuals. We use the Jarque-Bera test which tests 
for normality in both the univariate and multivariate case. The test rejects 
the null hypothesis of normality for each univariate series and for the multi- 
variate case as well. In addition, an investigation of the residuals showed no 
significant auto-correlation between the residuals. Thus, the assumption of 
independent and non-Gaussian residuals is not unreasonable, and LiNGAM 
can be used. 

A weak exogeneity test is performed, which tests the null hypothesis 
that each of the series does not respond to disturbances or shocks in the 
cointegration space, i.e. that the series is unresponsive to the deviations from 
the long-run relationships. This test is performed on ck, more specifically, 
for one particular series, we test whether the corresponding row in ex. (and 
hence in 11) is zero. 

Further, an exclusion test is performed, which tests the null hypothesis 
that a particular series is not in the cointegration space. This test is per- 
formed on /3, also here testing for a zero row. For more details on tests on 



cx and /3, see e.g. Juselius (2006). 
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Table [7] shows the p-values for both the weak exogeneity test and the 
exclusion test. We see that ElNordic, ElGermany and GasNBP are rejected 
at a 3% or lower significance level in the weak exogeneity test, meaning 
that the long-run relationships in the data are important for these series, 
whereas for the other series there is not evidence for this. In the exclusion 
test, ElNordic, ElGermany, EURUSD, GasNBP and GasZEE are rejected. 
Hence, there is strong evidence that these series are included in the long- 
run relationships. An exclusion test is also performed on the constant term, 
which results in a rejection of the null hypothesis at a 2% significance level. 
This agrees with the Schwarz conclusion in Table [5j 



ElNordic 
ElGermany 
Oil 

EURUSD 
Coal 
GasNBP 
GasZEE 



Weak exogeneity 
0.0065 
0.0000 
0.3722 
0.8023 
0.5012 
0.0298 
0.1866 



Exclusion 
0.0001 
0.0000 
0.1458 
0.0019 
0.1388 
0.0000 
0.0000 



Table 7: The p-values for the weak exogeneity test and the exclusion test, whose null 
hypothesis is that a particular series does not respond to shocks in the cointegration 
space, and that a particular series is not in the cointegration space, respectively. 

Figure |4] displays the impulse responses for all series, i.e. the responses 
of each series to a shock in each series. Each column shows the up to ten 
week responses of all series caused by an impulse (a one-time-only shock) in 
one of the series (the column headers show the impulses, whereas the row 
headers show the responses). The responses are normalised so that they can 
be compared with each other. 

In order to get the impulse responses, the causal ordering among vari- 
ables is needed. For Figure |4], we have followed the standard approach and 
used Bernanke ordering (Bernanke, 1986). The innovations are written as a 



function of more fundamental, internally orthogonal sources of variation, Ut, 

given by = A Vf, where A is a matrix representing how the innovations 
Bt are caused by orthogonal variation in each variable. Alternatively, we 
could here have used our LiNGAM based ordering. 

As seen on the diagonal, all series respond positively to their own shocks, 
and except for EURUSD, these responses are also strong. ElGermany re- 
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spends quickly and strongly to shocks in ElNordic, whereas there is not 
much impulse response the other way around. GasNBP and GasZEE have 
a slowly increasing response to a shock in ElNordic, whereas they respond 
much quicker to a shock in ElGermany. 

ElNordic is mostly affected by impulses in GasNBP and GasZEE, and 
also here, these responses are slowly increasing over time. Besides being 
affected by impulses in ElNordic, ElGermany is also affected by impulses in 
GasNBP and GasZEE. The response caused by a shock in GasNBP is much 
quicker for ElGermany than for ElNordic. Further, GasZEE is more affected 
by a shock in GasNBP than the other way around. 

Finally, we see that Oil, EURUSD and Coal are neither causing any 
significant responses in the other series, nor responding to shocks in any of 
the other series. 



4-2. Learning contemporaneous and time-lagged causal DAGs 

In the following we investigate the instantaneous causal (-Bq) and lagged 
{Bi and B2) effects. 

The time series were standardised before the following DAG analysis, en- 
abhng direct comparison of the strengths of causal effects. -Bq, Bi and 
B2 were then estimated as described in Section 'LA Insignificant edges 
of Bq were removed using the resampling method described in Section 6.3 

the time-lagged 



2.4 



of Shimizu et al. (2006a). As seen in (11) in Section 
effects Bt^, t = 1,2, depend on both Bq and the matrix Mr of pure au- 
toregressive effects. Therefore, the resampling method used for Bo is not 
available for the time-lagged effects, and it is not clear from Equation (11) 
how we could assess significance e.g. using p-values. However, since the data 
are standardised, we may simply use a cutoff in effect size as our significance 
threshold. We have chosen to remove all effects from Bi and B2 that are 
smaller in absolute value than the 70% absolute value quantile of all the 
elements in Bi. 

To illustrate the advantages of the use of the LiNGAM methodology, we 
show instantaneous effects estimated using the GES algorithm, as imple- 
mented in Tetrad IV (2004). The results are shown in the partially directed 



acyclic graph (PDAG) in Figure [5j The PDAG shows the entire equivalence 
class as a single graph. Having a directed edge in the PDAG means that this 
edge has the same orientation for all DAGs in the equivalence class. Undi- 
rected edges in the PDAG have different orientations for different members 
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EINordic EIGermany Oil 



Impulses 

EURUSD Coal GasNBP GasZEE 



EINordic 





Figure 4: Impulse response plot: Each column shows the up to ten week responses in all 
series to a one-time-only shock in the series listed in the column header. 



of the Markov equivalence class. Note that the PDAG in Figure |5] is com- 
pletely undirected, so no directions of causal influences can be determined 
in this case. Figure |5] shows an association between the coal price and the 
EURUSD. No price information seems to flow to or from the oil price, while 
the Nordic and German electricity prices seem to be connected through the 
two gas prices. 

Figure [6] shows the graphical representation of Bq, estimated using LiNGAM, 
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Figure 5: The instantaneous causal effects obtained using the GES algorithm in Tetrad 
rv|(l2004l). 



as described in Section 2^ Most of these instantaneous effects are intuitively 
reasonable. The main difference between the DAG Bq and the PDAG ob- 
tained using the GES algorithm in Figure [5] is that the latter lacks directions 
of the edges. Again, information does not flow to/from the oil price. As 
expected, the arrow goes from EURUSD to coal prices. Information flows 
from GasZEE to GasNBP, ElNordic and ElGermany. This is partly surpris- 
ing, but we should keep in mind that the Nordic reservoir levels and German 
wind have already been accounted for in the model, and it might be that, 
contemporaneously, GasZEE plays an important role. 




Figure 6: Bq: The instantaneous causal effects. 
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Figure 7: Bi: The causal effects witli lag one. The smallest effects have been removed- 




Figure 8: B2: The causal effects with lag two. The smallest effects have been removed. 

Figure [7] and |8] show the graphical representations of Bi and B2, respec- 
tively. Note that these graphs are directed, but cyclic, so they are not DAGs. 
This is natural for time-lagged relationships. We see that all variables influ- 
ence themselves at time lag one, and that ElNordic, EURUSD, Coal and Oil 
even influence themselves at time lag two. 

At lag one (-Bi), ElGermany is (mainly) influenced directly and indi- 
rectly by EURUSD and GasZEE, indirectly by Oil, and directly by Coal and 
ElNordic. GasNBP is influenced by GasZEE (and GasNBP itself), but influ- 
ences nothing else. Note, however, that some of the effects are quite small, 
except for the EURUSD Coal, EURUSD ^ ElGermany and GasZEE ^ 
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GasNBP relationships. At lag two (-B2), there are fewer strong effects, except 
that EURUSD seems to play an important role. 



5. Discussion 



Using time series models combined with new advances in causal infer- 
ence, we have studied how dynamic price information flows among Northern 
European electricity spot prices and prices of major electricity generation 
fuel sources. Applying our methods to weekly Nordic and German electric- 
ity prices, and oil, gas and coal prices, as well as German wind power and 
Nordic water reservoir levels, we have estimated a causal model for the price 
dynamics, both for contemporaneous and lagged relationships. 

We find that the oil price, coal price and EUR/USD exchange rate are 
non-stationary, while Nordic and German electricity prices, as well as British 
and Zeebriigge gas prices are stationary. Our results can be compared with 



the results from Mjelde and Bessler (2009), who study the US market, even 



though we have treated Nordic water reservoir levels and German wind power 
as exogenous variables. There are a few noteworthy similarities and differ- 
ences. [Mjelde and Bessler] include both peak, and off-peak prices, while we 



consider base prices. Note, however, that the peak/off-peak difference in the 
Nordic electricity market is less pronounced due to the very flexible hydro 
power. Contrary to Mjelde and Bessler we find only positive innovation 
shock responses, for example from natural gas to coal, where there is a nega- 
tive response in the US study. We both find a strong connection between gas 
and electricity prices. In contemporaneous time, we find a causal link from 
(Zeebriigge) gas prices to the electricity markets, while the US study gives 
the opposite conclusion. We find that coal and EURUSD together stand 
alone in contemporaneous time. In the US study, where the exchange rate is 
not included in the analysis (since all prices are in USD), coal stands alone 
in contemporaneous time. We find that even oil stands alone in contempo- 
raneous time, which could be explained by the difference in European and 
US gas markets (Hobaek Haff et al. , 2008), even though they may converge 



due to the increase in liquefied natural gas trade (Neumann, 2009). As with 



Mjelde and Bessler (2009), we find that all price series are cointegrated with 
a few cointegrating vectors (three in our case). At longer horizons, electricity 
prices and British gas prices adjust themselves to establish the equilibrium 
price level, since oil, coal, continental gas and EUR/USD are found to be 
weakly exogenous. In our analysis, however, and contrary to the US study. 
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the exclusion test casts some doubt on whether the oil and coal prices are 
part of the cointegrating space. 

Generally, the British gas prices are not important for the electricity 
markets when the Zeebriigge gas price is included, which is expected, since 
the Zeebriigge gas market is closer to the electricity generation and grid. The 
fact that coal prices do not play an important role in contemporaneous time 
in our analysis, while gas does, could first of all be because we have employed 
the more liquid GIF ARA price, while local producers may pay a different 
price, which may also partly be the case for the Zeebriigge gas prices. Second, 
the coal price has a low volatility compared to the gas and electricity prices, 
and naturally reacts more slowly to peak demand, since the coal prices' 
influence is affected by transportation time and costs. Third, there has been 
speculation that the oil and gas markets in Europe are decoupling (see e.g. 



Panagiotidis and Rutledge (2007)), which could also partly explain why the 



oil and gas prices play different roles in this Northern European commodity 
price game. 

In our view, there are two main methodological advantages of our ap- 



proach, as compared to previous work (Mjelde and Bessler, 2009 Park et al. 
2006, 2008). First, we are able to identify one unique contemporaneous 



graph, as opposed to a Markov equivalence class (which might be large). 
Second, we are able to properly and coherently deal with both instantaneous 
and time-lagged effects in the same analysis. Park et al. (2006) (p. 97) state 
that "in contrast to the directed graph analysis, forecast error variance de- 
composition and impulse response functions allow for analysis of dynamic 
information flows over time", i.e. in their view, DAGs are only applicable for 
analysing instantaneous effects. We have shown that DAGs are in fact useful 
for combining time-lagged and instantaneous effects. 

Implicit in our premise of statistically independent errors/residuals is the 
assumption of having no unobserved confounders: Any unmeasured common 
cause of any two of our variables would skew our results and create a depen- 



dence. It is possible to include latent variables in the LiNGAM model (Hoyer 



et al. , 2008), but we have seen this as out of the scope of our paper, due to 



the added complications of dealing with time series data. 

Our approach is a flrst attempt at a causal model for the price dynamics, 
and can be improved in many ways. Future work could include non-linear 
causal discovery (Hoyer et al. , 2009 ), incorporating possible effects of stochas- 
tic volatility and investigating the price dynamics on a flner time scale, for 
example with daily instead of weekly price series. 
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